A Classification Approach to Boolean Query Reformulation

نویسندگان

  • James C. French
  • Donald E. Brown
  • Nam-Ho Kim
چکیده

One of the difficulties in using current Boolean-based eration of networks and online databases makes it possible information retrieval systems is that it is hard for a user, to disseminate and retrieve information in fast and easy especially a novice, to formulate an effective Boolean ways. Information updates are often simpler and cheaper query. Query reformulation can be even more difficult in electronic form than on paper. One might think that and complex than formulation since users often have electronic access would provide all the necessary infordifficulty incorporating the new information gained from the previous search into the next query. In this article, mation from a large database in a short time with little query reformulation is viewed as a classification probeffort. However, consider retrieving relevant articles from lem, that is, classifying documents as either relevant or a collection of millions of documents. A system with low nonrelevant. A new reformulation algorithm is proposed performance can easily overload the user with unneceswhich builds a tree-structured classifier, called a query tree, at each reformulation from a set of feedback docusary information or lose potentially valuable information. ments retrieved from the previous search. The query tree What most users want is quality output, not quantity. The can easily be transformed into a Boolean query. The fact is that finding the right information gets harder as query tree is compared to two query reformulation algomore information becomes available, and the big question rithms on benchmark test sets (CACM, CISI, and Medis how the user can retrieve only the information he relars) . In most experiments, the query tree showed significant improvements in precision over the two algoquested. rithms compared in this study. We attribute this The most prevalent information retrieval (IR) model improved performance to the ability of the query tree today is the Boolean retrieval model. A query is a Boolean algorithm to select good search terms and to represent expression and, in the simplest case, a search is based on the relationships among search terms into a tree structure. exact matching.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of users’ query reformulation behavior in Web with regard to Wholis-tic/analytic cognitive styles, Web experience, and search task type

Background and Aim: The basic aim of the present study is to investigate users’ query reformulation behavior with regard to wholistic-analytic cognitive styles, search task type, and experience variables in using the Web. Method: This study is an applied research using survey method. A total of 321 search queries were submitted by 44 users. Data collection tools were Riding’s Cognitive Style A...

متن کامل

Circuit Complexity Meets Ontology-Based Data Access

Ontology-based data access is an approach to organizing access to a database augmented with a logical theory. In this approach query answering proceeds through a reformulation of a given query into a new one which can be answered without any use of theory. Thus the problem reduces to the standard database setting. However, the size of the query may increase substantially during the reformulatio...

متن کامل

Effective Reformulation of Boolean Queries with Concept Lattices

In this paper we describe an approach, implemented in a system named REFINER, to combining Boolean information retrieval and content-based navigation with concept lattices. When REFINER is presented with a Boolean query, it builds and displays a portion of the concept lattice associated with the documents being searched centered around the user query. The cluster network displayed by REFINER sh...

متن کامل

Implementing High Performance Retrieval Process by Max-Score Ranking

This paper presents a comparison report of two different processes of retrieving a keyword or data’s from a given database or from a multiple databases. The process1 known as Extended Boolean Retrieval (EBR) model, it gives us an output from the database. Since EBR model implementation aspects lead to a high cost, we consider an p-norm approach to the EBR implementation. P-norm approach plays a...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIS

دوره 48  شماره 

صفحات  -

تاریخ انتشار 1997